Frame theory in directional statistics 



Martin Ehler^'^'*, Jennifer Galanis^'^ 

^National Institutes of Health, National Institute of Child Health and Human Development, 

Section on Medical Biophysics, Bethesda, MD 20892 
^ University of Maryland, Department of Mathematics, Norbert Wiener Center, College Park, 

MD 20742 

^Institute of Chemistry and The Fritz Haber Center, The Hebrew University of Jerusalem, 

Jerusalem, 91904, Israel. 



Abstract 

Distinguishing between uniform and non-uniform sample distributions is a com- 
mon problem in directional data analysis; however for many tests, non-uniform 
distributions exist that fail uniformity rejection. By merging directional statis- 
tics with frame theory, we find that probabilistic tight frames yield non-uniform 
distributions that minimize directional potentials, leading to failure of unifor- 
mity rejection for the Bingham test. Finally, we apply our results to model 
patterns found in granular rod experiments. 
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1. Introduction 

Observations that inherit a direction occur in many scientific disciplines. 
For example, directional data arise naturally in the biomedical field for protein 
structure, cell-cycle, and circadian clock experiments BBHSSQ. Fur- 



ther examples occur in statistical mechanics, where experiments containing only 



rod-shaped particles can develop complex directional ordering [20|, l2l|, |22|, |33|. 
A simple pattern change for rod shaped objects is the density-dependent, order- 
disorder phase transition [H, [s^] shown for macroscopic granular rod experi- 



ments 



2m[ in Fig. 1(a) and 1(b). Quantification of this transition relies on a 
principle component analysis (PCA) type measure that is linked with statistical 
mechanical theories [i^]. When applied to experimental samples whose rod ori- 
entations shift from uniform to unidirectional, this measure finds the dominant 
direction (director) and strength of rod ordering (order parameter). 
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In reality, however, experimental rod orientations are rarely unidirectional, 
and spatial distortions in the director field frequently occur. These distortions 
may result from fluctuations and/or other competing forces, like those exerted 
by the container boundaries in Fig. 1(c) and 1(d). Accurately quantifying rod 
orientations can, in fact, yield information about the collective behavior of rods. 



for example elastic properties |2l|, |22]. While accurate orientation measure- 
ments of molecular-sized rods require special techniques jlsl, recent advances in 
single molecule detection may make such measurements more widely accessible 
(^,[3]. For example, "labeled" rods can be inserted into various environments 
and serve as local directional sensors by aligning with the rod-shaped material 
around them. This technique can in principle be used in environments as com- 
plicated as a cell 0, [s^ and potentially uncover intricate patterns that require 
more sophisticated measures of directional order. 

The resulting complex patterning may reduce the value of the order pa- 
rameter, sometimes to the point that the sample is inaccurately classified as 
disordered. Fig. 1(d). To predict which multidirectional patterns cause such 
misclassifications, we merge directional statistics with frame theory. Frames 
have proven useful in fields like spherical codes, compressed sensing, signal pro- 
cessing, and wavelet analysis jH,]!, H, [13, 11, E3, H, 0, 16, 25]. A frame is a 



basis-like system that spans a vector space but allows for linear dependency, 
which can be used to reduce noise, find sparse representations, or obtain other 
desirable features unavailable with orthonormal bases. Tight frames even pro- 
vide a parseval type formula similar to orthonormal bases. Moreover, the frame 
concept has recently been generalized to probability distributions on the unit 
sphere [13l]. 

To analyze granular rod patterning, we consider statistical testing for direc- 
tional uniformity, focusing on the Bingham test. We characterize non-uniform 
sample distributions that lead to failure of rejection and find that these distri- 
butions are probabilistic tight frames. Since these frames are well-understood 
in terms of algebraic/geometric conditions [13], further synergistic effects may 
develop between directional statistics and frame theory. 



2. Directional statistics 

Common tests in directional statistics focus on whether or not a sample on 
the unit sphere S^~^ = {x G : ||x|| = 1} is uniformly distributed. Here, 
we concentrate on two elementary tests for uniformity, Rayleigh and Bingham. 
Given a discrete sample {xi}2=i C S^~^, we follow the textbook [ill and define 
the mean as 

1 

(1) 

n ^-^ 

1=1 

where the polar representation x = rxg splits the mean into a mean direction 
xq G S^~^ and a mean resultant length f = ||x||. The Rayleigh test rejects 



2 



the hypothesis of uniformity if f is large. More precisely, the asymptotic large- 
sample distribution of dnf^ under uniformity is distributed with an error 
0{n~^)^ while the modified Rayleigh statistic (1 — ^)dnf'^ + 2n(d+2) ^^^^^^ is 



distributed with an error 0{n ^) |3l[ ]. 



To describe the Bingham test, let a denote the uniform probability measure 
on the sphere with respect to the Borel sigma algebra B. We first observe that 
the second moments of a satisfy 



Jsd-i a 



where x = . . . , x^'^))'"' G M*^ and i,j = 1, . . . , d. Note that the Fisher (or 
scatter) matrix^ 



1 " 



2=1 



of a sample {x^}^^;^ C 5^"^ equals the matrix of second moments of the un- 
derlying counting measure. Recalling that the matrix of second moments of 
the uniform measure equals ^X^, the Bingham test rejects the hypothesis of 
directional uniformity of a sample if its Fisher matrix Tj^^.jn ^ is far from \'^d- 

In fact, the Bingham statistic ^^^^ n (trace (T|^^ |n ) — \) under uniformity is 

x\d-i){d+2)/2 distributed with an error 0{n~^), cf. pll ]. 

We say that the Bingham test is inconsistent when rejection of uniformity 
fails for a particular non-uniform sample distribution. Here, we focus on those 
distributions that are multi-modal, where a mode is a local maximum of the 
distribution's density. Other analysis tools have been customized to spherical 
and more general manifold data in H, 27, 



3. Frames 

A collection of points {xi}^^^ C is called a finite frame for if there are 
two constants < A < called lower and upper frame hounds, respectively, 
such that 

n 
i=l 

where (•, •) denotes the usual inner product on M^. A frame spans M^, and any 
finite spanning set is a frame 0]. A collection of points {xi}^^^ C is called 
a finite tight frame for if there is a positive constant A, such that 

n 

A\\xf ^^\{x,Xi)\^, for all x e R*^. 
1=1 
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Every finite tiglit frame gives rise to the expansion 



X = ^ ^^{x, Xi)xi, for all x G M^, (2) 

i=l 

which generalizes the Parseval formula for othonormal bases [8]. We define a 
finite unit norm tight frame (FNTF) for as a tight frame whose elements all 
have unit norm. According to 0], the tight frame bound A of a FNTF is nld. 
The collection {xi}^^^ C S^~^ is a FNTF iff its Fisher matrix equals 



The literature contains many FNTFs. For example, [35| shows a FNTF for M 
composed of 6 vectors, and {(cos(a/e)5 sin(a/e))^ : /c = 1, . . . , n} is a FNTF for 

iff Ylli=i^^^^^ — [23]. In fact, any set of m unit norm vectors in R^ can 
be converted to a FNTF by adding m{d — 1) extra vectors 0]. 

We recall probabihstic frames as introduced in Let M{B^ S^~^) denote 
the collection of probability measures on the sphere with respect to the Borel 
sigma algebra B. An element fi G M{B^ S^~^) is called a probabilistic unit norm 
frame for R^ if there are constants < A < B such that 



< / \{x,y)\^dfi{y) < B\\xf, for ah x G R"^. (3) 

If we can choose A = S in (|3]), then we call fi a probabilistic unit norm tight 
frame for R^, and A must be equal to ^ 131]. We then have 



d / {x^y)ydji{y)^ for all x G 



where the integral is vector valued. This generalizes ([2|), and a sequence of 
pairwise distinct vectors {xi}^^^ C S^~^ is a FNTF for R^ iff the normalized 
counting measure ^iJ^xi...,xn ^ probabilistic unit norm tight frame for R^. 



4. Joining frame theory and directional statistics 

As introduced in Section El the Rayleigh test rejects uniformity if the mean 
resultant length is far from 0, while the Bingham test rejects uniformity if 
the sample's Fisher matrix is far from ^X^^. We therefore call a probability 
measure fi on the sphere a i?a7//ez^/i-alternative if its mean ft = Jsd-i xdii{x) 
is and a Sin^/iam-alternative if Mij(fi) = ^^ij for all i,j = 1 . . . , d. We 
first characterize Rayleigh- and Bingham-alternatives in terms of maximizers 
and minimizers, respectively, of certain potentials. Subsequently, we provide a 
connection to probabilistic frames. 

Bjoerck verifies in 0] that, among all probability measures /i G A1(S, S^~^)^ 
the maximizers of the probabilistic Riesz-2-potential 

/ / \\x -yfdii{x)dfi{y) (4) 
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are exactly the zero mean probability measures. Therefore, the maximizers of 
(jll are the Rayleigh-alternatives. 

To characterize Bingham-alternatives, we introduce the directional force F 
between two points a and b on the sphere S^~^ as F{a,b) = 2|(a,6)|(a — b). 
The physical potential between a and b is |(a,6)p The minimizers of the 
probabilistic frame potential 

PFP(/i)=/ / \{x,y)\^di^{x)di^{y), (5) 



among all probability measures jj. G Ai{B^ S^~^)^ are said to be in equilibrium 
under the directional force^ or simply in directional equilibrium (3, [ij] . Natu- 
rally, the uniform distribution is in directional equilibrium [3]; however, other 
distributions with a mixture of well-defined modes can also be in directional 
equilibrium. In fact, the minimizers of the directional potential ([5|) are exactly 
those probabilit y m easures jj. G M.{B, S^~^) whose second moments satisfy 
Mij{ii) = L-'-^l therefore, the Bingham-alternatives. These min- 

imizers were characterized as the probabilistic unit norm tight frames for 
in [13]. The advantage of the latter characterization is that tight frames are 
well-understood in terms of algebraic as well as geometric conditions [1, d, 13]. 

Results in and the present work imply that the Bingham-alternatives 
with zero mean are the minimizers of the fractional frame- Riesz-2-potential 

Jsd-i Jsd-i \ {x,y)\^d^{x)d^{y) 
Jsd-i Jsd-i \\x -y\\'^diJ.{x)diJ.{y)' 

Hence, probabilistic unit norm tight frames with zero mean are both, Rayleigh- 
and Bingham-alternatives. 



5. Patterning of granular rods 

5.1. Order-disorder phase transition 

A collection of rod shaped particles can undergo an order-disorder phase 
transition that, in the simplest model, is controlled by entropy. At low rod den- 
sities, the maximal total entropy occurs when both rotational and translational 
entropies are independently maximized. Beyond a critical density, however, 
rotational entropy is sacrificed for significant gains in translational entropy, re- 
sulting in a phase transition from randomly (uniformly) rotated rods. Fig. 1(a), 
to directionally oriented rods. Fig. 1(b). A PC A- type method measures the av- 
erage rod direction (director) and strength of rod alignment (order parameter) 
that results from rotational entropy loss and is described in the following: Let 
Xi G S^~^ denote the direction of the z-th rod out of n total rods. For simphc- 
ity, let us assume that the alignment is measured in a plane, hence d — 2. The 
covariance type matrix 

1 

Q2 = - V2x,x7-X2 (7) 
n ^-^ 

1=1 
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is therefore used to determine the director. Since Q2 is symmetric, the eigenvec- 
tors form an orthogonal basis, where the nonnegative eigenvalue A corresponds 
to the order parameter and the associated eigenvector corresponds to the di- 
rector, cf. [l9|, \2M\. In fact, < A < 1 and the second eigenvalue of Q2 equals 
-A. 

This PCA-type method only measures unidirectional rod ordering. In ex- 
periments, however, fluctuations and/or competing forces like container bound- 
aries. Figs. 1(c) and 1(d), can influence rod alignment. Therefore, the director 
field can vary spatially, potentially resulting in complex multidirectional pat- 
terning. Fig. 1(d), that sometimes cannot be distinguished from a disordered 
state when analyzed by the traditional order parameter. 




Figure 1: Rod patterns observed in 2D. (top) Images from granular rod experiments showing 
a disordered state (a), unidirectional ordered states with small (b) and large (c) standard 
deviations, and a multidirectional ordered state (d). The order parameters derived from 
Q2 in (0 are 0.0950 (a), 0.8902 (b), 0.4267 (c), and 0.1534 (d). The Bingham test rejects 
uniformity at a 1% confidence level for (b) and (c); however, uniformity cannot be rejected 
at a 5% confidence level for (a) and (d). Color represents the local order parameter, where 
Q2 is measured only in a vicinity of less than one rod length from each location in the image, 
(center) Directional histograms (blue) and directors Zi (dashed red lines) associated with each 
state, (bottom) We apply the Watson mixture model in (|9} with a suitable k. 
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5.2. New model by means of directional statistics 

We propose a complementary analysis of rod ordering that can more ac- 
curately quantify multidirectional alignment. We fit a probability model to 
experimentally observed rod patterning by first identifying a set of directions, 
cf. Fig. 1(b)- (d). Since a rod rotated by 180 degrees is indistinguishable from an 
unrotated rod, we do not differentiate between x and —x. Consistent with this 
criteria, let a e M{B, S^~^) be the uniform measure on the sphere, zq G S^~^^ 
and 7^ 0, the Watson measure ji is then given by 

m(^) = Crf(^) Qyiv{i^{z^,x)^)a{x), (8) 

where c^(a^) = 9^d/2^n /9^j/9^>t ; ^ the usual Gamma function, and F a confluent 



hyper geometric function (3l|]. For Ai: > 0, the density tends to concentrate 
around izg, whereas for < 0, the density concentrates around the great 
circle orthogonal to zq- And as |/^| increases, the density peaks tighten. 

Next, we model each sample with a mixture of Watson distributions, i.e., 
for a collection of directors {zi}f^^ d , v^e consider 

m(^) = ^^X]exp(/^(z^,x)2)a(x). (9) 

1=1 

By replacing von Mises measures in [3] with Watson measures, we conclude 
the following: if {zi}iL^ is a FNTF for then, for any 7^ 0, the measure ^ 
is a probabilistic unit norm tight frame for R^, hence a Bingham-alternative. 
Note that in M^, a FNTF with three elements (directions) must be equiangular, 
and all equiangular tri-directions are FNTFs [231]. Nevertheless, if a sample is 
distributed so that the modes in ([9]) approximate a minimizer, like the three 
nearly equiangular directions in Fig. 1(d), then the Bingham test may also fail 
to reject uniformity, as is the case with this figure. 

Finally, this approach extracts two parameter sets from the sample, direc- 
tional modes and associated widths [H, [l3, 26], where the widths represent a 



measure for directional ordering. These parameters will be used in a forthcom- 
ing paper to quantify the differences between experimental rod patterns and 
the expected behavior from theories and simulations. Furthermore, these tools 
may provide a method to identify more subtle rod patterning transitions, like 



the one described in |2l[ ] 
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